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INTRODUCTION 


I . 

The  Navy  Experimental  Diving  Unit  (NEDU)  and  the  Naval 
Coastal  Systems  Center  (NCSC)  in  Panama  City,  Florida  are 
actively  engaged  in  improvement  of  the  Navy  diving  program. 
Numerous  tests  have  been  conducted,  both  manned  and  simulated, 
to  assess  the  design  considerations  for  axial  flow  canisters 
u  d  to  remove  excess  carbon  dioxide  from  a  diver's  breathing 
medium. 

^ In  May,  1983,  NCSC  published  a  technical  manual  entitled 
"Design  Guidelines  for  Carbon  Dioxide  Scrubbers,"  which  was 
the  result  of  a  large  series  of  experiments  conducted  at  NCSC 
to  isolate  the  effects  of  various  parameters  on  canister  per¬ 
formance  [Ref.  1] .  The  data  from  these  experiments  was  obtained 
and  is  provided  as  Appendix  A  to  this  report. 

The  object  of  this  thesis  are: 

1)  to  provid  background  for  the  development  of  carbon 
dioxide  scrubbing  canisters, 

2)  to  present  a  review  of  the  work  of  Nuckols ,  Purer, 
and  Deason  in  developing  Reference  1, 

3)  to  utilize  the  techniques  of  multiple  regression 
analysis  to  identify  those  parameters  of  interest 
in  predicting  the  useful  life  of  a  canister,  and 

4)  to  develop  a  model  that  will  be  useful  in  predicting 
the  performance  of  axial  flow  carbon  dioxide  absorptive 
canisters . 

A.  BACKGROUND 

As  early  as  1878,  when  H.A.  Fleuss,  a  British  diver,  uti¬ 
lized  a  workable  solution  of  caustic  potash  to  remove  carbon 
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dioxide  in  a  closed  circuit  self-contained  underwater  breathing 
apparatus  (SCUBA) ,  chemical  absorbing  agents  were  being  inves¬ 
tigated  for  removal  of  carbon  dioxide  (C02)  from  a  breathing 
medium  [Ref.  1:  pp.  1-9].  Although  the  worldwide  development 
of  mixed-gas  underwater  breathing  apparatus  (UBA)  dates  from 
1912,  its  development  in  the  U.S.  Navy  followed  World  War  II. 
Mixed-gas  UBA  represents  systems  that  employ  a  lightweight 
mixed-gas  supply  and  a  diver-worn  gas  recirculation  system  to 
remove  carbon  dioxide. 

With  open  circuit  SCUBA,  diving  depth  and  duration  are 
sharply  restricted  by  the  low  efficiency  of  gas  utilization 
resulting  from  the  complete  discharge  of  exhaled  gases.  Approx¬ 
imately  five  percent  of  oxygen  consumed  is  actually  utilized  by 
the  diver  at  the  surface  and  this  percentage  decreases  with 
increasing  depth  [Ref.  2:  pp.  11-1],  Consequently,  in  order 
to  conserve  gas  supply  and  extend  underwater  duration,  it  is 
essential  to  improve  this  efficiency  of  gas  utilization.  This 
is  accomplished  by  recirculating  the  diver's  breathing  medium 
for  reuse,  removing  the  C02  produced  by  metabolic  action  in  the 
body . 

In  1965,  Lcdr  M.W.  Goodman,  MC,  USN,  writing  in  research 
report  3-64  for  the  Navy  Experimental  Diving  Unit,  addressed 
the  quantitative  considerations  of  design  and  performance  of 
cylindrical  canisters  used  to  scrub  C02  in  a  closed  circuit 
system  [Ref.  3].  He  stated  that  research  and  developmental 
efforts  related  to  the  problem  of  C02  removal  form  the  gaseous 
atmosphere  of  mobile  closed-circuit  life-support  systems  have 
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been  devoted  almost  exclusively  to  submersibles  and  aerospace 
vehicle  crew  modules.  There  had  been  no  fruitful  application 
of  procedure  or  hardware  to  the  diving  community,  which  needed 
compact  and  streamlined  packages  to  interface  with  a  non-propelled 
diver.  Goodman's  work  was  essentially  an  extension  of  the  work 
of  H.W.  Huseby  [Ref.  4]  and  G.J.  Duffner  [Ref.  5]  who  developed 
design  criteria  for  a  SCUBA  carbon  dioxide  removal  canister 
presuming  satisfactory  performance  for  180  minutes  at  30  feet 
and  30  minutes  at  180  feet.  The  problem  that  Goodman  addressed 
was  to  determine  the  factors  which  govern  the  function  of  SCUBA 
CC>2  absorbent  canisters,  and  the  quantitative  manner  of  their 
interdependence.  Utilizing  data  obtained  from  dives  using 
swimmers  and  experiments  using  a  mechanical  respirator,  he 
was  able  to  relate  breathing  resistance,  duration  of  useful 
canister  life  and  efficiency  of  absorbent  utilization,  all  to 
canister  geometry. 

This  development  of  desi  considerations  for  the  construc¬ 
tion  of  CC>2  absorbent  canist  has  been  an  ongoing  process 
at  both  the  Navy  Experimental  Diving  Unit  and  the  Naval  Coastal 
Systems  Center,  Panama  City,  Florida.  The  most  recent  work  has 
been  conducted  by  M.L.  Nuckols,  A.  Purer,  and  G.A.  Deason  of 
Naval  Coastal  Systems  Center. 

Starting  in  late  1981,  they  started  a  series  of  controlled 
laboratory  tests  to  isolate  the  effects  of  environmental  and 
geometric  parameters  on  canister  absorption  efficiency.  Their 
ultimate  goal  was  to  develop  a  manual  which  would  enable 
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engineers  to  predict  the  performance  of  prototype  designs  of 
axial  flow  canisters.  This  goal  was  realized  with  the  publi¬ 
cation  of  NCSC  Technical  Manual  4110-1-83  [Ref.  1].  This  manual 
presents  design  data  and  guidelines  to  predict  the  performance 
of  axial  flow  carbon  dioxide  canister  designs  using  alkali  metal 
hydroxide  absorbers.  In  addition,  methods  are  developed  to 
evaluate  efficiencies  of  different  carbon  dioxide  absorbents  in 
order  to  characterize  the  absorption  capability  of  a  soda-lime 
type  absorbent  presently  used  by  the  U.S.  Navy. 

B.  PHYSIOLOGICAL  EFFECTS  OF  CARBON  DIOXIDE 

This  section  presents  the  physiological  effects  on  the 
body  of  increased  concentrations  of  carbon  dioxide  in  inhaled 
gases  and  is  derived  from  the  NCSC  TECHMAN  [Ref.  1:  pp.  2-7]. 

The  process  of  breathing  is  characterized  by  the  inhalation 
of  oxygen  and  the  exhalation  of  a  nearly  equal  volume  of  C02- 
The  ratio  of  CO 2  produced  to  oxygen  consumed  is  called  the 
respiratory  quotient  (RQ)  and  is  assumed  to  be  0.85  for  purposes 
of  canister  design.  Various  respiratory  quotients  can  be  de¬ 
rived  from  Table  1  [Ref.  1:  p.  3]  for  various  swim  speeds. 
Respiratory  minute  volume  (RMV) ,  is  defined  as  the  volume  of 
gas  moved  in  and  out  of  the  lungs  per  minute,  while  values  for 
C02  produced  are  based  on  the  density  of  CC>2  at  standard 
temperature  and  pressure. 

In  a  closed  diving  system  where  C02  is  included  in  the 
inhaled  gas,  the  partial  pressure  of  CC>2  in  the  blood  increases 
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TABLE  1 


02  CONSUMPTION  AND  C02  PRODUCED 
FOR  VARIOUS  SWIM  SPEEDS 


Swim  Speed 
(kt) 

02  Consumption 

(Ipm) 

RMV 

(Ipm) 

g/min 

C02  Produced 

1pm  %  C02 

Low 

0.26 

7.0 

0.44 

0.221 

3.57 

Rest 

Mean 

0.34 

9.0 

0.57 

0.289 

3.64 

High 

0.42 

11.0 

0.70 

0.357 

3.68 

Low 

0.67 

17.0 

1  .  14 

0.576 

3.84 

0.5 

Mean 

0.82 

20.0 

1.41 

0.714 

4.02 

High 

1.00 

24.5 

1.72 

0.870 

4.05 

Low 

0.96 

24.5 

1.65 

0.835 

3.87 

0.7 

Mean 

1 .  14 

25.0 

1.96 

0.991 

4. 10 

High 

1.35 

34.0 

2.43 

1.228 

4.50 

Low 

1.13 

25.0 

1.94 

0.982 

4.19 

0.9 

Mean 

1.33 

37.0 

2.75 

1.391 

4.26 

High 

1.90 

49.0 

3.49 

1.768 

4.45 

Low 

1.34 

34.0 

2.43 

1.219 

4.03 

1.0 

Mean 

1.83 

44.0 

3.36 

1.701 

4.38 

High 

2.26 

55.0 

4.33 

2.190 

4.51 

Low 

1.87 

45.0 

3.43 

1.738 

4.39 

1.2 

Mean 

2.50 

60.0 

4.79 

2.425 

4.60 

High 

3.03 

66.0 

5.99 

3.030 

5.21 

causing  the  body  to  increase  the  rate  of  respiration.  Exces¬ 
sive  amounts  of  CC>2  in  an  inhaled  gas,  based  on  partial  pres¬ 
sure,  can  result  in  toxic  effects  dependent  on  exposure  times 
Figure  1  depicts  the  physiological  effects  of  CO 2  for  differ¬ 
ent  concentrations  and  exposure  periods  [Ref.  1:  p.  5].  The 
fractional  C02  concentration  in  a  gas  stream  (F^  )  can  be 
defined  as 

FC02  =  [(Vo2  xRQ)  ^  (Q  xPB)]  T  28*32 

where  V_  is  the  diver's  oxygen  consumption  rate  in  liters 
2 

per  minute,  RQ  is  equal  to  0.85,  Q  is  the  volumetric  flow 


rate  to  the  canister  in  cubic  feet  per  minute,  and  PB  is  the 
environmental  pressure  in  atmospheres  absolute. 
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Figure  1.  Physiological  Effects  of  CO 


Since  the  units  for  expressing  partial  pressure  are  usually 
mmHg,  the  partial  pressure  of  C02  (Pc0  )  can  be  computed  by 


P  =  f  *px760. 
^C02  C02  B 


Figure  2  [Ref.  1:  p.  6]  shows  C02  tolerance  zones  as  a  function 
of  percentage  of  C02  and  depth  at  partial  pressures  taken  from 
Figure  1  for  a  1-hour  exposure  period.  The  physiological 
effects  of  C02  depend  upon  its  partial  pressure  in  the  breathing 
gas  and  thus  in  the  blood.  As  diving  depth  increases,  the 
percentage  of  C02  in  the  breathing  gas  that  can  be  tolerated 
decreases . 


errors,  natural  variability  of  the  predictors  and  the  effect 
of  not  including  variables  which  affect  the  response.  The 
residuals,  e,  are  assumed  to  be  normally  and  independently 
distributed  with  zero  mean  and  common  variance. 

The  method  of  least  squares  minimizes  the  residual  sum  of 
squares  defined  by 

N  N  ^ 

RSS  =  >  e  =  7  (Y.  -  Y.  )  , 

•  i  i  i  l 

1=1  1=1 

where  the  hat  notation  signifies  an  estimated  or  predicted 
value,  i.e.,  Y  represents  the  predicted  values  of  the  response 
variable.  The  sample  variance  of  Y  given  X  can  be  defined  as 

N 

Sy|X  =  £  (Y.  -  Yi)2/(N-2)  , 

and  represents  an  unbiased  estimate  of  the  population  variance. 
The  denominator,  N-2,  represents  degrees  of  freedom  defined  as 
the  number  of  cases  minus  the  number  of  parameters  in  the  model. 
The  square  root  of  the  sample  variance  of  Y  given  X  is  called 
the  standard  error  of  regression  and  will  be  used  later  to  com¬ 
pare  models  generated  in  multiple  regression  where  its  definition 
is  similar. 

By  defining 

-  —  2  ^  —  2 
SXX  =  )  (X.  -X)  ,  SYY  =  >  (Y.  -Y)  ,  and 

i=l  1  i=l  1 

N 

SXY  =  /  (X .  -  X)  ( Y .  -  Y)  , 

1  =  1  1  1 


25 


must  be  made  based  on  the  experience  and  knowledge  of  the 
analyst . 

Regression  analysis  can  be  separated  into  two  stages,  the 
first  of  which  is  aggregate  analysis  where  the  intent  is  to 
combine  the  data  and  develop  a  model.  The  second  stage  is 
case  analysis  where  the  data  are  used  to  examine  the  suita¬ 
bility  and  correctness  of  the  fitted  model.  This  chapter  will 
present  a  brief  review  of  regression  theory  and  then  develop 
models  to  predict  effective  cannister  life,  represented  by  the 
variable,  TIME. 

A.  THEORY  OF  REGRESSION 

Although  many  estimation  procedures  exist  for  obtaining 
estimates  of  parameters  in  a  model,  this  thesis  will  use  the 
most  common  one  of  least  squares.  The  theory  for  this  develop¬ 
ment  is  extracted  from  References  11  and  12.  In  simple  linear 
regression,  a  model  is  constructed  to  fit  a  straight  like  through 
a  set  of  data  and  is  of  the  form:  Y  =  a  +  8X  +  c,  where  Y  is 
the  response  variable,  X  the  predictor  variable,  3  the  slope 
of  the  line,  x  the  Y-intercept  and  e  the  error  term.  The 
parameters  of  the  line  are  a  and  3.  Differences  in  the  values 
predicted  by  the  model  and  the  actual  observed  values  are 
called  statistical  errors,  or  residuals,  and  have  both  fixed 
and  random  components.  The  fixed  components  result  from  fitting 
a  straight  line  model  to  a  set  of  data  where  the  true  relation¬ 
ship  is  curvilinear.  Random  effects  result  from  measurement 


Although  is  affected  by  pressure,  it  remains  fairly 
constant  over  the  range  of  depths  represented  by  the  data  and 
is  therefore  excluded  [Ref.  1:  p.  15] .  The  absorbent  particle 
diameter  remains  constant  at  0.14  inches  and  also  is  excluded. 
Density  and  viscosity  are  dependent  on  the  gas  mix,  operating 
pressure  and  temperature.  These  variables  are  required  to 
calculate  Reynold's  number,  which  is  used  to  define  efficiency, 
however,  it  is  the  attempt  of  this  thesis  to  dervie  a  relation¬ 
ship  for  canister  breakthrough  time  as  a  function  of  the  data 
gathered  by  experimentation. 

A  multiple  regression  analysis  was  performed  on  this 
matrix  of  data  with  TIME  as  the  dependent  or  predicted  variable. 
The  purpose  of  this  analysis  was  to  examine  the  relationships 
existing  among  the  variables  and  to  develop  a  model  that  would 
be  useful  in  making  predictions  for  the  dependent  variable  within 
the  ranges  of  the  independent  variables.  The  most  common 
reasons  for  performing  a  regression  analysis  are  to  provide  a 
description  of  the  relationships  between  the  predictor  varia¬ 
bles  and  the  dependent  variable  and  provide  a  model  for  predic¬ 
tions  of  future  values.  It  is  important  to  understand  the 
difference  between  interpolation  and  extrapolation  when  predict¬ 
ing  values.  Interpolation  occurs  when  the  predictor  variables 
are  within  the  multidimensional  space  observed  in  construction 
of  the  model,  while  extrapolation  represents  predictions  using 
values  of  variables  which  are  outside  this  space.  Regression 
analysis  provides  a  basis  for  interpolation,  while  extrapolations 
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tB  =  f (RH, TEMP, WT,LENTH,DIAM, C02, F10W, PRES, HYDRA)  . 

These  nine  variables  are  the  same  that  are  utilized  in  the 
process  of  calculating  canister  efficiency  and  theoretical 
canister  life  to  obtain  actual  canister  life  as  described  in 
the  NCSC  Techman  with  the  exception  of  p  (gas  density) ,  p  (gas 
viscosity) ,  e  (absorbent  particle  diameter) ,  Dv  (gas  mass 
diffusivity) ,  and  chemical  hydration  level  (HA) .  Absorbent 
moisture  content  or  hydration  level  was  assumed  to  be  fixed 
from  the  manufacturer,  however,  samples  of  varying  moisture 
content  were  constructed  and  tested  at  NCSC  and  consequently 
hydration  level  is  included  as  the  independent  variable  HYDRA. 


TABLE  4 

RANGES  OF  THE  INDEPENDENT  VARIABLES 


MEAN 

VAR 

MIN 

MAX 

FH  (%) 

56.764 

2301.2 

0 

100 

TEMP  ( °F) 

66.472 

91.8 

35 

80 

WT  (gm) 

19.994 

4049.8 

2.05 

383.56 

LENTO  (in) 

2.679 

1.2 

1.078 

16 

DIAM  (an) 

1.489 

2.8 

0.975 

9.446 

ao2  (%) 

1.149 

1.3 

0.031 

4.35 

FLCW  (cc/min) 

1039.887 

10,965,549.7 

23.837 

31,185.74 

PRESS  (ata) 

4.202 

54.6 

1 

32 

HYDRA  (%) 

13.386 

9.5 

0.6 

14.1 
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II.  MODEL  DEVELOPMENT 


The  data  generated  by  the  experiments  of  Nuckols,  Purer, 
and  Deason  are  based  on  steady  flow  through  axial  canisters. 

A  diver's  respiratory  cycle  actually  represents  an  intermit¬ 
tent  or  pulsatile  flow,  the  effects  of  which  are  discussed  in 
Reference  10. 

The  data  is  in  matrix  form  and  is  provided  in  Appendix  A. 
There  are  377  rows,  representing  independent  trials  of  the 
experiment  and  10  columns,  representing  the  9  independent  and 
1  dependent  variables.  The  columns  from  left  to  right  repre¬ 
sent  relative  humidity  in  percent  (RH) ,  temperature  in  degrees 
Fahrenheit  (TEMP) ,  weight  of  the  chemical  absorbent  in  grams 
(WT) ,  time  to  breakthrough  in  minutes  (TIME) ,  canister  length 
in  inches  (LENTH) ,  canister  diameter  in  centimeters  (DIAM) , 
percent  by  volume  of  C02  in  the  inlet  gas  at  depth  (C02), 
surface  rate  of  flow  of  the  inlet  gas  in  cubic  centimeters  per 
minute  (FLOW) ,  absolute  operating  pressure  expressed  in  atmos¬ 
pheres  (PRESS),  and  the  hydration  level  of  the  chemical  absor¬ 
bent  expressed  as  a  percentage  (HYDRA) . 

The  ranges  of  the  independent  variables  are  provided  in 
Table  4,  while  statistics  on  each  variable  or  column  of  the 
matrix,  are  provided  in  Appendix  B.  The  data  set  of  Appendix 
A  allows  for  actual  canister  breakthrough  time,  t_, ,  to  be 

D 

expressed  as  a  function  of  as  many  as  nine  variables,  represented 
as  : 
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In  other  words,  efficiency,  n ,  is  defined  to  be  a  function  of 
five  dimensionless  groups  represented  by  Reynold's  number, 
temperature,  relative  humidity,  a  ratio  of  absorbent  particle 
diameter  to  canister  diameter,  and  a  ratio  of  canister  length 
to  canister  diameter. 

E.  EXPERIMENTAL  DESIGN 

The  experimental  set-up  depicted  in  Figure  3  was  utilized 
by  Nuckols,  Purer,  and  Deason  to  derive  the  data  set  relating 
canister  breakthrough  time  to  various  parameters  [Ref.  9: 
pp.  3-4].  Helium  was  utilized  as  the  main  ingredient  of  the 
inlet  gas  because  of  its  high  thermal  conductivity  which  allowed 
for  quick  dissipation  of  the  heat  of  reaction  resulting  in  a 
more  uniform  distribution  of  temperature  across  the  test 
canister.  Although  the  diameter  of  the  test  cell  was  varied, 
it  was  kept  small  to  insure  a  constant  temperature  throughout 
the  absorbent.  Relative  humidity  was  varied  by  passing  inlet 
gas  through  a  bubble  tower.  Temperature  of  the  test  cell  was 
varied  from  35  F  to  70  F,  while  weight  of  the  aosorbent  depended 
on  cell  dimensions  and  packing  density.  The  percentage  of  CC>2 
in  the  inlet  gas  was  varied  over  a  realistic  range  for  divers, 
while  the  flow  rate  ranged  as  high  as  31,185.35  cubic  centimeters 
per  minute  measured  at  the  surface.  Absolute  pressure  was 
varied  from  one  atmosphere  (surface)  to  thirty-two  atmospheres 
(1023  feet  dive  depth).  Various  moisture  contents  of  Sodasorb 
were  constructed  by  adding  water  or  baking  the  absorbent  [Ref.  1], 
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range  of  the  dimensionless  groups.  Since  the  number  of  dimen¬ 
sionless  groups  is  less  than  the  original  number  of  variables, 
considerable  time  and  expense  is  saved  during  the  process  of 
experimentation  [Ref.  8]. 

Canister  efficiency,  n#  was  defined  as  a  ratio  of  actual 
breakthrough  time,  t^,  to  theoretical  breakthrough  time, 
tTH:  n  =  Theoretical  breakthrough  time  is  computed 

by  dividing  the  absorption  capacity  of  the  chemical  absorbent 
by  the  rate  of  CO ^  delivered  to  the  canister  as  follows: 


_  _  AW 

where : 

A  =  mass  of  CO2  absorbed  per  mass  of  absorbent; 

W  =  mass  of  absorbent; 

Q  =  absolute  volumetric  flow  rate; 

C  =  CO2  volume  fraction  in  gas  stream  at  depth; 

=  density  of  CO^  at  depth  for  given  temperature. 


The  dimensional  analysis  resulted  in  an  expression  for  canister 
efficiency,  n,  of: 


n  =  f (ReD,T,H,e/D,L/D) 


where  Ren  is  the  Reynold's  number  and  is  defined  as: 


Re. 


p  Ve 

P 
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TABLE  3 


PARAMETERS  OF  CANISTER  C02  ABSORPTION 


Parameter 

Identification 

Dimensions 

S 

Breakthrough  time 

t 

V 

Gas  stream  velocity 

L/t 

T 

Gas  temperature 

T 

P 

Gas  density 

m/L3 

P 

Gas  viscosity 

m/Lt 

C 

C02  concentration 

l3/l3 

H 

Gas  stream  water  vapor 
content 

l3/l3 

HA 

Absorbent  water  content 

l3/l3 

W 

Absorbent  mass 

■ 

e 

Particle  size 

L 

L 

Canister  length 

L 

D 

Canister  diameter 

L 

D 

V 

Hass  diffusivity 

L*/t 

A 

Absorption  capacity 

m/m 

R 

Reaction  rate 

m/t 

Bade  Unit* 

t  =  Time  a  =  Mas* 

L  =  Length  T  =  Temperature 


performed,  the  theory  of  which  is  described  in  Reference  8. 
Dimensional  analysis  is  useful  when  the  variables  involved  in 
a  process  are  known,  but  the  relationships  among  the  variables 
aie  not  known.  If  the  variables  can  be  reexpressed  by  only  a 
few  dimensionless  groups,  then  relationships  between  the  group 
can  be  derived  which  are  applicable  to  all  cases  within  the 
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chemical  absorbent.  This  radial  movement  of  gas  is  a  function 
of  the  C02  concentration,  the  mass  diffusivity  of  C02 ,  the 
distance  from  the  centerline  of  the  canister  to  the  absorbent 
surface  expressed  in  terms  of  canister  diameter  and  absorbent 
particle  size,  and  the  turbulence  in  the  gas  stream  as  a 
function  of  gas  density  and  viscosity  [Ref.  1] . 

As  discussed  earlier,  water  is  necessary  for  the  completion 
of  the  absorption  process  and  may  be  provided  in  the  form  of 
water  vapor  in  the  exhaled  gas  of  the  diver  or  in  the  chemical 
absorbent  itself.  Once  the  chemical  absorption  process  has 
been  started,  the  reaction  rate  must  be  fast  enough  to  keep  up 
with  the  rate  at  which  CO^  is  arriving.  The  reaction  rate  of 
the  process  is  highly  influenced  by  temperature  [Ref.  1]. 

Table  3  lists  the  various  parameters  of  interest,  which, 
although  not  all  inclusive,  were  presented  in  Reference  1  as 
having  the  most  influence  on  canister  duration.  As  the  table 
depicts,  canister  duration  is  then  a  function  of  fourteen 
variables : 


tn  =  f (V,T,p,y,C,H,HA,W,e,L,D,A,D„,R) 

This  functional  relationship  was  simplified  by  observing 
that  the  absorbent's  reaction  rate  (R) ,  capacity  to  absorb 
CO 2  (A) ,  and  moisture  content  (HA)  were  beyond  the  control  of 
the  user,  reducing  the  number  of  independent  variables  to  eleven. 
Next,  a  dimensional  analysis  of  the  resulting  relationship  was 


16 


TABLE  2 


PROPERTIES  OF  HYDROXIDES 


Chemical 

Formula 

Molecular 

Weight 

CC>2  Capacity 
gmCO,v/100  gm 

Solubility  in 
Water  grrydOO  cc 

Causticity 

Potassium 

Hydroxide 

KCH 

56.1 

39.2 

115 

1.0 

Sodium 

Hydroxide 

NaOH 

40.0 

55.0 

109 

0.95 

Lithium 

Hydroxide 

LiCH 

23.9 

91.9 

13 

0.11 

Berium 

Hydroxide 

Ba(OH) 

171.4 

25.7 

5.6 

0.05 

Calcium 

Hydroxide 

Ca(OH)  2 

74.1 

59.4 

0.183 

0.0015 

Magnesium 

Hydroxide 

Mg(CH)2 

58.3 

75.5 

0.0009 

7.0E-7 

canister  exceeds  physiological  limits  for  human  respiration. 

The  value  of  0.5  percent  C02  by  volume  is  generally  accepted 
for  experimental  use.  Thus,  t  provides  a  measure  of  a  canister's 
life  or  effective  duration. 

The  absorption  process  within  a  canister  involves  three 
separate  processes  which  must  work  together  in  order  for  the 
canister  to  perform  the  scrubbing  process.  These  processes 
are  defined  as  "mass  transfer  of  CC>2  from  the  gas  stream  to 
the  absorbent  surface,  absorption  at  the  surface,  and,  finally, 
chemical  absorption"  [Ref.  1:  p.  12].  In  an  axial  flow 
canister,  if  the  speed  of  the  gas  through  the  canister  is  too 
great,  or  if  the  length  of  the  canister  is  too  short,  the  mole¬ 
cules  of  CC>2  will  not  have  time  to  move  radially  through  the 
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while  excessive  moisture  will  tend  to  flood  the  canister,  block¬ 
ing  the  critical  contact  between  the  CC>2  laden  gas  and  the 
chemical  absorbent  particles  and  also,  inhibit  or  slow  down  the 
reaction.  Nuckols,  Purer,  and  Deason  report  that,  "the  moisture 
content  in  the  canister  can  change  during  its  use  due  to  an 
imbalance  in  the  water  content  of  the  incoming  and  outgoing 
gas  streams.  Gas  saturated  with  water  normally  would  result 
in  an  increase  in  moisture;  however,  if  the  heat  of  reaction 
is  not  dissipated,  high  temperature  areas  will  result  in  the 
evaporation  of  water  from  certain  sections  of  the  canister  thus 
rendering  such  zones  ineffective  for  carbon  dioxide  removal" 

[Ref .  7 :  p.  59 ]  . 

Although  various  hydroxides  are  available  for  use  as  chemi¬ 
cal  absorbents,  the  U.S.  Navy  has  decided  to  use  absorbents 
based  on  calcium  hydroxide.  Table  2,  comprised  of  data  from 
Reference  1,  lists  various  hydroxides  along  with  various 
properties.  It  is  of  interest  to  note  that  while  calcium 
carbonate  does  not  perform  as  well  as  lithium  hydroxide,  its 
causticity  relative  to  potassium  hydroxide  is  lower. 

D.  PARAMETERS  OF  CANISTER  C02  ABSORPTION 

This  section  is  a  brief  overview  of  the  work  of  Nuckols, 
Purer,  and  Deason  [Ref.  1]  in  identifying  the  most  important 
or  influential  variables  in  the  prediction  of  canister  break¬ 
through  time,  t.  Canister  breakthrough  time  is  defined  as  the 
time  at  which  the  C0_  concentration  in  the  gas  exiting  the 
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Sodasorb,  a  registered  trademark  name  [Ref.  6],  It  consists  of 
calcium  hydroxide,  14  to  19  percent  moisture  by  weight,  less 
than  five  percent  sodium  hydroxide,  potassium  hydroxide  or 
barium  hydroxide,  and  ethyl  violet  as  a  sensitive  acid/base 
indicator.  Ethyl  violet  is  added  to  the  composition  to  indi¬ 
cate  whether  the  absorbent  chemical  has  been  used  or  not.  As 
the  CO 2  is  absorbed,  the  resulting  pH  change  causes  the  granules 
to  turn  from  colorless  to  blue. 

The  actual  process  of  C02  removal  by  Sodasorb  is  a  chemical 
reaction.  The  CC>2  forms  carbonic  acid  which  combines  with 
hydroxide  to  form  sodium  carbonate  and  water.  Sodium  carbonate 

a 

then  reacts  with  lime  to  form  calcium  carbonate.  The  entire 
process  is  characterized  by  the  following  equations  [Ref.  7: 
p.  58]: 

C02  +  H20  *  h2C03 

2H2C03  +  2NaOH  +  2K0H  -  Na2C03  +  4H20  +  K2C03 
2Ca  (OH )  2  +  Na2C03  +  K2C03  -►  2CaCC>3  +  2NaOH  +  2K0H  . 

It  is  important  to  note  that  the  chemical  reactions  will  reach 
their  completion  provided  that  the  C02  laden  gas  remains  inside 
the  canister  long  enough  and  adequate  mass  exchange  can  be  made 
between  the  absorbent  and  the  gas.  Although  water  is  a  byproduct 
of  the  above  process,  water  is  required  to  initiate  the  above 
reactions.  Insufficient  moisture  will  then  inhibit  the  process 
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Figure  2.  CO  Tolerance  Zones 

In  closed-circuit  dive  systems,  all  exhaled  gases  can  be 
rebreathed  by  passing  the  exhaled  gas  through  a  canister  con 
taining  a  chemical  C02  absorber  before  the  gas  is  inhaled. 
Oxygen  that  is  used  in  the  process  of  metabolism  is  replaced 
by  oxygen  from  a  bottle  carried  by  the  diver. 


C.  CHEMICAL  PROCESS  OF  C02  ABSORPTION 

The  C02  absorbent  used  throughout  the  studies  conducted 
at  Naval  Coastal  Systems  Center  was  High  Performance  (HP) 
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ere  the  bar  denote  a  sample  average,  the  sum  of  squares  due 
co  regression  can  be  defined  as 

SSR  =  ( SYY  -  RSS )  =  (SXY)2/SXX  . 

In  simple  regression,  the  degrees  of  freedom  (d.f.)  associated 
with  SSR  is  one. 

The  coefficient  of  determination  for  multiple  regression 

is  defined  as  F  =  SSR/SYY  and  represents  a  scale  free  one- 

number  summary  of  the  strength  of  the  relationship  between  the 

X  and  Y  of  the  data.  It  is  easily  interpreted  as  a  percentage 

of  variability  in  the  criterion  that  is  explained  by  the  model. 

It  also  is  the  same  as  the  square  of  the  sample  correlation 

2 

between  X  and  Y.  R  has  long  been  used  to  compare  models  of 
the  same  data  and  its  value  in  comparisons  will  be  discussed 
later . 

Multiple  regression  implies  that  two  or  more  predictor 
variables  are  used  to  model  the  response  variable.  The  basic 
model  is:  Y  =  a  +  3-^X^  +  62*2  +  ...  +  S^X^  +  e  where  a  and  8 
are  unknown  parameters,  e  represents  statistical  error,  Y  is 
the  response  variable  and  the  X's  are  the  independent  variables. 
The  "real  world"  geometric  interpretation  of  this  model  breaks 
down  as  it  is  the  equation  of  a  K-dimensional  hyperplane  in 
( K+l )  dimensional  space. 

Analysis  of  variance  (ANOVA)  'rovides  a  method  whereby 
the  fit  of  various  models  to  the  ume  set  of  data  can  be 
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compared.  The  F  statistic  is  defined  as  F  =  MSR/RMS,  where 
MSR  is  the  regression  mean  square  obtained  by  dividing  SSR 
by  d.f.  and  RMS  is  the  residual  mean  square  obtained  by  dividing 
RSS  by  d.f.  It  compares  the  variation  due  to  regression  to 
variation  due  to  error.  The  F  statistic  is  used  to  test  the 
hypothesis  that  all  of  the  regression  weights  in  the  model  are 
equal  to  zero.  If  the  calculated  F  value  exceeds  the  tabled 
value  with  the  appropriate  degrees  of  freedom,  then  one  can 
be  at  least  100(l-a)%  sure  that  at  least  one  of  the  predictors 
is  significant  [Ref.  12],  The  significance  level  is  represented 
by  a . 

2 

The  coefficient  of  determination,  R  ,  is  now  defined  as 
R2  =  (  (SYY-RSS)  ,/SYY)  =  (SSR/SYY)  and  is  the  same  as  the  square 

of  the  multiple  correlation  coefficient  between  Y  and  the  X's. 

2 

Most  importantly,  R  is  the  square  of  the  maximum  correlation 
between  Y  and  any  linear  function  of  the  X's,  i.e.,  the  pre¬ 
dicted  values  of  Y.  Table  5  provides  a  summary  of  ANOVA  for 
multiple  regression  where  K  is  the  number  of  predictor  variables 
and  N  is  the  number  of  cases  or  trials. 


Source 


Regression 

(X1  ,^2  ,  *  *  ‘ 


TABLE  5 

ANOVA  FOR  MULTIPLE  REGRESSION 
SS  d.f.  MS 

SSR  K  MSR  =  SSR/k 


F 

MSR/ RMS 


Residuals  (error)  RSS 


N- ( K+l )  RMS  =  RSS/ ( N-K-l ) 


Total  SST  =  SYY  N-l 

=  RSS  +  SSR 
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2 

Sir  'e  the  calculation  of  R  is  dependent  on  the  data  set 

2 

from  wmch  the  model  was  constructed,  R  associated  with  the 

2 

development  sample  will  tend  to  be  larger  than  R  for  the 

2 

population.  This  has  led  to  the  definition  of  adjusted  R  , 

2  2 
Rw  which  is  an  attempt  to  correct  R  to  more  closely  fit  the 

2 

population.  Adjusted  R  is  defined  as 

R2  =  R2  -  ( (P(l-R2) )/(N-P-l)  , 

where  N  is  the  number  of  cases  and  P  is  the  number  of  predictor 

variables  in  the  model  [Ref.  13:  p.  97]. 

Lane  has  defined  the  terms  overfit  and  shrinkage  to  explain 
2 

the  facts  that  R  for  the  equation  development  sample  will  be 

2  2 
larger  than  R  for  the  population  (overfit) ,  while  R  developed 

from  using  a  model  to  predict  over  a  new  sample  will  be  lower 
2 

than  R  for  the  equation  development  sample  (shrinkage)  [Ref. 
14].  His  study  proposes  three  shrinkage  estimation  formulas 
identified  by  their  authors  as  fellows: 

Wherry:  R2  =  1  -  ( 1- R2)  [ ( N- 1 ) / (N-P- 1 )  ] 

Nicholson- Lord:  R2  =  1  -  (1-R2)  [  (N-l)  /  (N-P-l )  ]  [  (N+P-  .)  /Nj 

Darlington:  R2  =  1  -  (1 -R2)  [ (N-1 )/ (N-P-l )  ]  i  (N+i ) /N ! 

[  (N-2)/(N-P-2)  ] 

where  N  is  the  number  of  cases,  P  is  the  number  of  predictor 
2 

variables,  R  is  the  coefficient  of  determination  for  the 
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2  2 

equation  development  sample  and  represents  R  adjusted  for 
the  population.  The  Wherry  formula  can  be  shown  to  be  equiva¬ 
lent  to  the  formula  proposed  earlier  in  Reference  13  for  adjusted 
2 

R  .  The  three  formulas  are,  from  top  to  bottom,  increasingly 

conservative.  The  Wherry  formula  was  developed  to  estimate 
2 

the  value  of  R  to  be  expected  if  weights  from  a  given  sample 
were  applied  to  a  large  number  of  samples  from  the  same  popula¬ 
tion.  Nicholson  and  Lord  noted  that  the  Wherry  formula  actually 
2 

estimates  R  for  the  population;  i.e.,  the  result  expected  if 

population  weights  were  applied  to  the  entire  population  rather 

than  the  result  of  applying  a  sample  equation  to  the  population. 

Both  the  Nicholson-Lord  and  Darlington  equations  are  derived 

from  equations  developed  for  residual  mean  square  and  estimate 
2 

R  resulting  from  the  application  of  a  sample  equation  to  the 
population  [Ref.  14]. 

In  simple  regression,  a  t-statistic  can  be  computed  as 
t  =  3/S  |  ,  and  is  used  to  test  the  null  hypothesis  that  d, 

y  i x 

2 

or  the  slpe,  is  equal  to  zero.  It  is  observed  that  t  =  F 
when  d.f.  =  1.  In  multiple  regression,  the  t-statistic  is 
used  to  test  the  null  hypothesis  that  any  one  regression  weight 
or  coefficient  estimate  is  equal  to  zero.  The  t-statistic  is 
calculated  by  dividing  the  estimate  of  the  coefficient  by  its 
standard  error.  This  statistic  is  then  compared  to  the  tabled 
t  value  with  d.f.  equal  to  N-K-l. 

In  order  to  place  any  validity  on  the  results  obtained 
from  ANOVA  techniques,  various  assumptions  are  made  concerning 


29 


the  data  and  distribution  of  the  error  term.  These  assumptions 
are  discussed  informally  by  Amick  and  Walberg  [Ref.  15]  and 
more  formally  by  Weisberg  [Ref.  11]  and  Younger  [Ref.  12]. 

There  are  basically  five  of  these  assumptions  which  will  be 
presented  next. 

1.  The  model  is  correctly  specified.  Every  variable  (both 
dependent  and  independent)  is  in  its  proper  form. 

Linear  regression  requires  that  the  relationship  of 
the  response  variable  to  the  set  of  predictor  variables 
be  linear.  If  the  relationship  is  actually  curvilinear, 
then  transformations  of  the  data  must  be  used  to 
transform  a  theoretically  nonlinear  model  to  linear 
form. 

2.  The  data  are  representative  of  the  range  over  which 
predictions  are  to  be  made. 

3.  Each  observation  or  trial  can  be  considered  to  be 
independent  of  the  others. 

4.  The  errors  in  the  model  are  assumed  to  be  normally 

and  independently  distributed  with  zero  mean  and  common 
variance . 

5.  The  distribution  of  the  possible  values  of  the  response 
variable  must  be  normal  about  the  fitted  line  for  all 
values  of  the  predictor  variables. 

These  assumptions,  as  they  apply  to  the  data  set  predicting 

canister  life,  will  be  discussed  later  in  this  chapter.  The 

actual  technique  of  checking  for  inadequacies  of  a  model  is 

called  case  analysis  and  is  discussed  next. 


B.  CASE  ANALYSIS 

The  analysis  of  the  correctness  and  suitability  of  a  model 
is  called  case  analysis.  In  order  to  understand  the  technique 
and  to  place  the  author  and  reader  on  common  ground,  it  is 
necessary  to  define  various  terms  used  as  well  as  notation. 
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Weisberg  defines  studentized  residual,  r^,  as  the  residual 
divided  by  an  estimate  of  its  standard  deviation.  The  advan¬ 
tage  of  studentized  residuals  (SRESID)  over  residuals  is  that 
that  standard  deviation  of  the  r^  =  1,  for  all  i.  The  studen¬ 
tized  residual  is  also  used  in  the  calculation  of  a  t-statistic 
to  be  used  in  a  test  for  outliers.  The  t-statistic  is  calcu¬ 
lated  by 


t.  =  r • ( (N-K-l)/(N-K-r2) 1/2 

1  l  l 

where  N  is  the  number  of  trials  and  K  the  number  of  predictor 
variables  in  the  model.  If  the  calculated  statistic  exceeds 
the  table  value  of  critical  values  for  the  outlier's  test, 
given  significance  level  a  with  degrees  of  freedom  N,  then  the 
hypothesis  that  the  case  is  not  an  outlier  is  rejected  with 
confidence  100(l-a)%.  Table  D  of  Reference  11  is  a  table  of 
critical  values  for  the  outlier's  test  [Ref.  11]. 

Outliers  represent  data  points  that  lie  beyond  the  fitted 
equation  and  consequently  will  not  be  predicted  with  much 
accuracy  by  the  model.  They  are  easily  identified  in  case 
analysis  by  their  large  residual  or  studentized  residual  and 
can  be  confirmed  by  the  outlier's  test  previously  discussed. 
Outliers  can  be  the  result  of  randomness  of  the  data,  experi¬ 
mental  error,  or  they  might  represent  new  and  unexplainable 
information.  They  can  not  be  routinely  deleted  from  the 
analysis,  but  must  be  examined  for  the  source  of  their 
aberration . 
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The  studentized  residuals  defin  above  represent  a  scaling 
of  residuals,  so  that  cases  which  are  located  far  from  the 
centroid  of  the  data  space  result  in  increased  residuals,  while 
cases  closer  to  the  centroid  result  in  decreased  residuals. 

Cook's  distance  is  defined  using  studentized  residuals  and  is 
used  as  a  measure  of  the  influence  a  single  point  has  on  the 
development  of  the  model.  The  size  or  magnitude  of  Cook's  dis¬ 
tance  will  be  determined  by  the  magnitude  of  the  studentized 
residual  and  the  distance  of  the  point  from  the  centroid  of  the 
data  space.  A  data  point  that  exerts  a  large  influence  on  a 
model  or  fitted  equation  might  have  a  small  residual  as  it 
essentially  pulls  the  equation  towards  it,  but  can  be  identi¬ 
fied  by  Cook's  distance  due  to  its  position  outside  the  centroid. 
Highly  influential  cases  should  be  examined  in  an  attempt  to 

identify  the  source  or  cause  of  their  uniqueness  and,  if  deleted, 

2 

will  result  in  a  lower  value  of  R  .  However,  more  confidence 
can  be  placed  in  the  model  in  terms  of  its  future  application 
[Ref.  11]. 

The  single  most  important  plot  in  case  analysis  is  the 
plot  of  residuals  with  the  predicted  values  of  the  dependent 
variable,  Y.  Weisberg  recommends  using  studentized  residuals 
in  the  plot  because  each  r^  has  zero  mean  and  variance  of  1  so 
that  95%  of  the  r^  should  fall  within  ±2  and  99%  should  be 
contained  within  tl  [Ref.  11].  The  plot  should  exhibit  no 
characteristic  trends,  i.e.,  should  be  a  random  scatter  of 
points  about  the  axes.  Such  a  plot  would  support  the  assumptions 
of  linearity  of  the  model  and  common  variance  for  the  errors. 
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The  standardized  residual  (ZRES1D)  for  the  1  case  is 
defined  by  Norusis  as  the  residual  divided  by  the  sample 
standard  deviation  of  the  residuals  [Ref.  13].  It  is  useful 
in  examining  the  assumption  of  normality  of  the  errors  of  the 
model.  As  with  the  studentized  residuals,  95%  should  lie 
between  t2  and  99%  between  ±3.  Additionally,  a  normal  proba¬ 
bility  plot  can  be  used  as  a  check  to  see  if  the  residuals 
represent  a  homogeneous  sample  from  a  normal  distribution.  The 
closer  the  plot  resembles  a  straight  line,  the  more  certain 
one  is  that  the  distribution  of  residuals  is  normal.  If  the 
distribution  is  not  normal,  then  a  possible  remedy  is  a 
transformation  of  the  dependent  and/or  response  variable 
[Ref.  11]. 

C.  PROCEDURES  FOR  SUBMODEL  SELECTION 

Once  the  set  of  predictor  variables  has  been  defined  and 
appropriate  transformations  of  the  data  have  been  made,  the 
next  step  is  to  select  that  combination  of  variables  that  pro¬ 
vides  the  best  model.  The  number  of  possible  models  to  examine 
is  obviously  based  upon  the  number  of  predictor  variables 
available.  If  that  number  is  large  enough,  a  factor  analysis 
is  appropriate  to  construct  a  smaller  set  of  factors  that  are 
combinations  of  the  variables  and  essentially  reduce  the  data 
set  to  the  relevant  dimensions  of  that  set. 

With  the  computer  it  is  possible  to  search  through  a  large 
number  of  variables  to  select  subsets  of  those  variables  as 
possible  models.  One  method  of  subset  selection  is  called 
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stepwise  regression  for  which  three  algorithms  exist;  forward, 
backward,  and  stepwise.  The  main  advantages  of  these  algorithms 
are  that  they  are  fast  and  relatively  easy  to  compute.  They 
do  not  provide  a  "best"  model,  but  examine  subsets  following 
a  systematic  technique.  The  three  methods  of  stepwise  regres¬ 
sion  are  discussed  by  Weisberg  [Ref.  11]  and  proceed  as  follows. 

The  forward  selection  algorithm  first  selects  the  indepen¬ 
dent  variable  with  the  highest  sample  correlation  with  the 
response  variable.  The  next  independent  variable  added  is 
chosen  by  the  following  three  equivalent  criteria:  1)  among 
the  remaining  variables,  it  has  the  highest  correlation  with 
the  dependent  variable,  adjusting  for  the  independent  variable 

already  chosen;  2)  its  addition  will  yield  the  largest  increase 
2 

in  R  of  the  remaining  variables;  and  3)  its  t-statistic  is 
largest  of  the  remaining  variables.  Forward  selection  can  be 
stopped  by  the  following  rules:  1)  stop  when  a  limit  on  the 
number  of  independent  variables  is  reached;  2)  stop  when  the 
significance  level  of  the  t-statistic  for  the  remaining  varia¬ 
bles  exceeds  a  pre-set  value;  3)  stop  when  the  addition  of 
another  variable  would  make  the  set  of  predictor  variables 
too  collinear.  Collinearity  is  considered  by  computing  a 

linear  regression  of  the  independent  variable  of  interest  with 

2 

the  variables  already  in  the  model.  R  for  this  regression  is 

2 

computed  and  tolerance  is  defined  as  1-R  .  If  the  tolerance 
is  less  than  some  pre-set  value,  then  the  procedure  stops. 

The  drawback  to  this  method  is  that  a  variable  selected  early 
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in  the  model  development  may  become  non-significant  and  yet 
remain  in  the  final  model. 

The  backward  elimination  algorithm  starts  with  a  model 
constructed  with  all  of  the  independent  variables  and  then 
removes  the  variables  one  at  a  time.  The  variable  removed  is 
the  one  having  the  smallest  t-statistic.  Stopping  rules  are: 

1)  stop  when  the  number  of  independent  variables  specified 
are  remaining  or  2)  stop  if  the  significance  level  of  the 
remaining  variables  exceeds  a  pre-set  value. 

The  stepwise  procedure  is  actually  a  combination  of  the 
forward  and  backward  methods.  It  selects  the  first  variable 
as  the  one  with  the  highest  sample  correlation.  From  then 
on,  it  selects  and  ejects  variables  based  on  entry  and  removal 
criteria  set  by  the  user  and  based  on  the  significance  level 
of  the  t-statistic  for  the  variable. 

D.  BUILDING  THE  MODEL 

The  New  Regression  package  of  the  SPSS  (Statistical  Package 
for  the  Social  Sciences)  Batch  System  was  used  on  the  IBM  3033 
to  perform  a  regression  analysis  of  the  data  in  Appendix  A. 

The  benefits  provided  by  this  package  are:  1)  a  compute 
statement  that  allows  the  computation  of  transformations  of 
the  variables  to  provide  a  better  fit,  2)  the  ability  to  use 
the  forward,  backward,  or  stepwise  algorithms  for  model 
selection,  3)  an  extensive  case  analysis  and  graphis  capability, 
which  enable  a  check  of  assumptions. 
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Examination  of  the  correlation  matrix  of  Table  6  reveals 
that  diameter,  flow,  and  weight  are  all  highly  correlated. 

The  correlation  between  diameter  and  weight  is  expected  because 
as  the  diameter  of  the  test  cell  is  increased,  the  amount  of 
absorbent  that  the  cell  can  hold,  increases.  The  correlation 
of  flow  rate  to  diameter  and  weight  appears  to  be  a  result  of 
the  experimental  manipulation  of  flow  to  represent  various  rates 
for  different  dive  profiles.  Flow  rates  were  kept  relatively 
low  due  to  the  small  size  of  the  test  canister. 


TABLE  6 

CORRELATION  MATRIX 


RH 

TEMP 

WT 

TIME 

LENTH 

RH 

1.000 

-0.032 

-0.293 

0.127 

-0.108 

TEMP 

-0.032 

1.000 

0.093 

0.122 

0.050 

WT 

-0.293 

0.093 

1.000 

0.081 

-0.045 

TIME 

0.127 

0.122 

0.081 

1.000 

0 . 192 

LENTH 

-0 . 108 

0.050 

-0.045 

0.192 

1.000 

Cl  AM 

-0 . 322 

0.113 

0 .974 

0.062 

-0.093 

co2 

0.043 

-0.025 

-0.015 

-0.218 

-0.140 

FLOW 

-0 . 249 

-.091 

0 .854 

-0 .086 

-0.063 

PRESS 

0  .  127 

0 . 157 

-0 . 109 

0.084 

0.016 

HYDRA 

-0.041 

-0.085 

0.060 

0  .  Ill 

-0.007 

DIAM 

co2 

FLOW 

PRESS 

HYDRA 

RH 

-0 . 322 

0.043 

-0 .249 

0.127 

-0.041 

TEMP 

0.113 

-0.025 

0.091 

0 . 157 

-0.085 

WT 

0.974 

-0.015 

0.854 

-0 . 109 

0 .060 

TIME 

0.062 

-0.218 

-0.086 

0.084 

0  .  Ill 

LENTH 

-0.093 

-0 . 140 

-0 .063 

0.016 

-0.007 

DI  AM 

1.000 

0.044 

0 .850 

-0  .133 

0.071 

CO  2 

0.044 

1.000 

-0.025 

-0 .406 

0.023 

FLOW 

0 .850 

-0.025 

1.000 

-0  .  102 

0.041 

PRESS 

-0.133 

-0.406 

-0.102 

1.000 

0 . 101 

HYDRA 

0.071 

0 . 023 

0.041 

0 . 101 

1.000 

Mak 

ing  the 

assumption  t 

t  the  nine 

independent 

variables 

are  the  predictors  most  affecting  the  dependent  variable,  time, 
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a  regression  of  time  with  these  nine  variables  produced  an  equa¬ 
tion  of  the  form 


Y  = 


n 

z 

i=l 


0.X. 
1  1 


with  R  =  0.234.  The  independent  variables  selected  were  C02 , 
LENTH,  RH,  DIAM,  FLOW,  HYDRA,  and  TEMP.  Table  7  lists  the 
ANOVA  table  and  variable  coefficients  generated  by  the  regres¬ 
sion  program.  Examination  of  the  histogram  and  normal  proba¬ 
bility  plot  of  studentized  residuals  reveals  departure  from 
normality.  A  histogram  of  residuals  is  provided  in  Figure  4. 


TABLE  7 


ANALYSIS  OF  VARIANCE  TABLE 


SOURCE 

SS 

OF 

MS 

F 

GRAND  MEAN 

25761545.093  1 

REGRESSION 

8997912.646  7 

12854)6.092 

16.090 

RESIDUAL 

29478974.041  369 

79888.819 

TOTAL 

64238431.780  377 

170393.718 

THE  SIGNIFICANCE  OF  REGRESSION  - 

1 .0000 

(SIGNIFICANCE 

:  AREA  LINDER  CURVE  FROM  0  TO  CONFUTED 

n 

R  SOU ARE 

- 

.234 

COEFFICIENT 

B/SIGMA(B) 

CONFIDENCE 

INTERVAL 

TERM 

B 

SIGMA(B) 

T 

LOVER 

UPPER 

BO 

-414.684 

133.426 

“3.108 

•677.112 

■152.256 

B1 

-67.361 

12.927 

*5.211 

-92.787 

-41 .936 

82 

57.751 

13.465 

4.289 

31.267 

84.234 

83 

1.516 

.325 

4.661 

.877 

2.156 

84 

1 20 . 644 

17.218 

7.007 

86.778 

1 54 . 509 

85 

-.055 

.008 

-6.571 

-.072 

-.039 

86 

11 .945 

4.762 

2.508 

2.579 

21 .31  1 

87 

3.472 

1.542 

2.252 

.439 

6.504 

THE  THEORETICAL  VALUE  FOR  T  AT  THE 

0.05  LEVEL  AND 

369  DF  - 

1  .967 

TIME  =  f (C02 , LENTH, RH , DIAM, FLOW, HYDRA, TEMP) 


o 
-400 
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Figure  4.  Histogram  of  Residuals 

The  plot  of  residuals  versus  predicted  values  in  Figure  5, 
reveals  structure  and  suggests  that  a  transformation  of  data 
is  required. 

Examination  of  Figure  6  reveals  that  the  distribution  of 
TIME  is  positively  skewed  and  that  the  natural  log  transforma¬ 
tion  of  time,  LN  TIME,  more  closely  represents  a  normal 
distribution.  Consequently,  the  next  step  was  to  regress 
LN  TIME  with  all  of  the  variables.  This  results  in  an  equation 
of  the  form: 
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Figure  5.  Plot  of  Residuals  vs.  Predicted 
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or  more  appropriately 


LN  TIME 


LN  FLOW  +  1 2  LN  WT  +  3  3  LN  C02 

LN  PRESS  +  3r  LN  RH  +  6,  LN  TEMP) 
5  6 


Setting  1 .  =  in  K,  where  K  =  expfS^),  the  above  equation  can 
be  transformed  to 


TIME  =  K  ( FLOW)  (WT) 


(co2) 


(press: 


(RH)  (TEMP) 


which  is  a  multiplicative  model.  The  ANOVA  table  of  Table  12 
demonstrates  that  all  coefficients  remain  significant  at  the 
0.0001  level . 

TABLE  12 


ANALYSIS  OF  VARIANCE  TABLE 


SOURCE 

SS 

OF 

MS 

F 

CRANO  MEAN 

4524.367 

1 

REGRESSION 

319.147 

6 

53. 

191 

366.335 

RESIDUAL 

26.281 

181 

145 

TOTAL 

4869.794 

188 

25. 

903 

I ME  SIGNIFICANCE 

OF  REGRESSION  -  1 . 

0000 

(SIGNIFICANCE:  AREA  UNDER 

CURVE  FROM  C 

!  TO  COASTED  F) 

R  SCUARE 

- 

.924 

COEFFICIENT 

B/SIGUA(B) 

CONFIDENCE 

INTERVAL 

TERM 

B 

SICUA(B) 

T 

LCWER 

UPPER 

BO 

6.128 

.713 

8  589 

4.720 

7.535 

B! 

-1  .551 

.037 

-41 .886 

-1.624 

-1 .478 

B2 

1  .62  3 

.048 

33  874 

1  .528 

1.717 

B3 

-1.609 

.069 

-23.160 

'1.746 

-1.472 

B4 

-1.634 

.084 

-19.997 

-1.850 

-1.518 

05 

.060 

.005 

12  290 

.050 

.069 

86 

1  .234 

.173 

7  143 

.893 

1  575 

THE  THEORETICAL 

VALUE  FOR  T  AT  THE  0 

05  LEVEL  AID 

181  DF  - 

1  .974 

LN  TIME  =  f ( LN  FLOW , LN  WT , LN  C02,LN  PRESS, LN  RH , LN  TEMP) 
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The  next  step  was  to  examine  how  well  this  latest  model 
developed  with  half  of  the  data  predicted  over  the  remaining 
half.  Through  various  calculations  as  earlier  defined,  the 
values  in  Table  11  were  obtained. 

TABLE  11 

STATISTICS  FOR  CROSS-VALIDATION 

Development  Sample  Cross-Validation  Sample 

Variance  of  Y  =  1.847  Variance  of  Y  =  1.52 

Variance  of  predicted  Variance  of  predicted 

values  =  1.699  values  =  1.401 

P2  =  0.9199  R2  =  0.8464 

cv 

These  results  indicate  that  although  the  model  does  not  predict 

as  well  over  a  new  set  of  data  as  is  expected,  the  obtained 
2 

R  of  0.852  supports  the  contention  of  a  strong  relationship 
between  the  predictor  variables  and  the  dependent  variable. 

An  additional  model  was  developed  using  the  same  half  of 
the  data  as  before,  but  the  model  was  restricted  to  having  only 
natural  log  transformations  of  the  nine  independent  variables. 
The  New  Regression  package  using  the  stepwise  algorithm  selected 
a  model  of  the  form 

LN  TIME  =  f ( LN  FLOW , LN  WT , LN  C02,LN  PRESS, LN  RH , LN  TEMP) 
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PREDICTED  VALUES 


Figure  12. 


Plot  of  Residuals  vs.  Predicted 


Figure  13.  Plot  of  LN  TIKE  vs.  Predicted 
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Returning  to  the  model  developed  during  cross-validation, 


the  normal  probability  plot  and  residual  plot  of  Figure  11 
support  the  assumption  of  normality  of  errors.  The  plot  of 
residuals  versus  predicted  values  in  Figure  12  shows  little, 
if  any,  structure. 


Figure  11.  Histogram  of  Residuals 

The  plot  of  actual  values  versus  predicted  in  Figure  13  sup¬ 
ports  the  assumption  of  common  variance  of  the  residuals. 
Examination  of  the  ZRESID  or  standardized  residuals  reveals 
that  99%  are  within  -3  and,  therefore,  support  the  contention 
that  residuals  or  errors  are  normally  distributed. 
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coefficient  is  small.  The  column  labeled  BETA,  represents 
the  standardized  regression  coefficients  derived  by  multiplying 
the  variable  coefficient  by  a  ratio  of  the  standard  deviation 
of  the  independent  variable  to  the  standard  deviation  of  the 
dependent  variable  [Ref.  13].  The  actual  coefficients  do  not 
represent  the  importance  of  the  variable  in  the  regression 
equation,  since  the  independent  variables  are  not  expressed 
in  the  same  units.  The  BETA  weights  when  standardized  are 
dimensionless,  however,  even  then,  they  do  not  represent 
importance  since  the  predictors  are  intercorrelated . 


TABLE  10 


ANALYSIS  OF  VARIANCE  TABLE 


SOURCE 

SS 

DF 

MS 

F 

GRAND  MEAN 

A524.367 

1 

REGRESSION 

337.780 

6 

52.963 

346.731 

RESIDUAL 

27.648 

181 

.153 

total 

4869 . 794 

188 

25.903 

THE  SIGNIFICANCE  OF  REGRESSION  -  I . 

0000 

(SIGNIFICANCE 

:  AREA  UNDER 

CURVE  FROM  C 

l  TO  COMMUTED  F) 

R  SOU  ARE 

.920 

COEFFICIENT 

B/SIGMA(B) 

CONFIDENCE 

INTERVAL 

TERM 

8 

SIGMA(B) 

T 

LOWER 

UPPER 

BO 

10.317 

259 

39  870 

9.806 

10.828 

B1 

-.680 

.030 

-22.340 

-.740 

-.620 

B2 

-1 .538 

.038 

-40.552 

-1 .613 

-1 .463 

B3 

1.625 

.049 

33.064 

1.528 

1  .722 

BA 

.058 

.005 

1  1  687 

.048 

.068 

85 

-.276 

.034 

-8.053 

-.344 

-.209 

86 

.022 

.003 

6.750 

.016 

.029 

THE  THEORETICAL  VALUE  FOR  T  AT  THE  0. 

05  LEVEL  AM) 

181  DF  - 

1.974 

LN  TIME  =  f(C02,LN  FLOW , LN  WT ,  LN  RH , LN  PRESS, TEMP) 


49 


model  can  be  tested  by  comparing  standard  errors  or  by  using 

2 

the  standard  errors  to  calculate  R  for  the  equation  develop- 

2  2  2 
ment  sample  (R0)  and  R  ,  for  the  cross-validation  sample  (R  ). 

s  cv 

2  2 

The  difference  of  Rg  -  Rc^  was  defined  earlier  as  shrinkage  by 
Lane  [Ref .  14 ]  . 

Based  on  the  above  discussion,  the  data  set  was  split  in 
half  using  a  random  number  generator  and  a  regression  of  LN  TIME 
with  all  variables  plus  natural  log,  second  order,  and  inverse 
terms  examined.  This  resulted  in  an  equation  of  the  form 

LN  TIME  =  f ( CO 2 , LN  FLOW,  LN  WT ,  LN  RH ,  LN  PRESS,  TEMP)  +  constant 

with  R“  =  0.91996.  This  model  can  be  represented  as 

LN  TIME  =  3q  +  £1C02  +  32  LN  FLOW  +  3  3  LN  WT  +  g  LN  RH 
+  P5  LN PRESS  +  36  TEMP 


or 

TIME  =  exp(:fl  +  31C02  +  82  LN  FLOW  +  33  LN  WT  +  LN  RH 

+  3 5  LN PRESS  +  i6  TEMP) 

and  is  exponential  in  form.  Table  10  lists  the  ANOVA  table 
and  variable  coefficients  generated  by  the  regression  program. 

It  is  observed  from  the  table  that  all  coefficients  are  signifi¬ 
cant  at  the  a  =  0.0001  level  and  the  standard  error  of  each 
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2 

to  1.0,  the  better  the  fit.  However,  R  is  th  2  square  of  the 
sample  correlation  coefficient,  R,  which  measures  the  strength 
of  the  relationship  between  the  predictors  and  response.  The 
range  of  R  is  from  minus  1.0  to  plus  1.0  with  values  between 
minus  1.0  and  zero  representing  inverse  relationships  and  values 
between  zero  and  plus  1.0,  direct  relationships.  Younger 
proposes  the  following  scale  to  represent  the  strength  of  the 
relationship  indicated  by  R  [Ref.  12]. 
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For  example,  from  the  graph,  an  R  of  0.5625  would  translate 

to  an  R  of  0.75  indicating  a  moderately  strong  relationship. 

2 

The  second  question  concerns  the  data  dependency  of  R  . 

2 

Adjusted  R  can  be  calculated  as  previously  discussed  to 

2 

predict  a  value  of  R  for  the  population  or  the  model  can  be 
tested  by  examining  how  well  it  predicts  with  a  new  set  of 
data.  Since  a  regression  model  is  developed  to  fit  a  data  set 
as  closely  as  possible,  testing  how  well  a  model  predicts  over 
this  same  data  set  is  almost  certain  to  overestimate  performance 
[Ref.  17].  This  idea  has  led  to  the  notion  of  simple  cross- 
validation,  where  a  data  set  can  be  split  in  half  and  the  model 
then  developed  over  one  half  and  tested  on  the  other  half.  The 
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Figure  10.  Plot  of  Residuals  vs.  Predicted 

Using  the  stepwise  procedures  of  New  Regression,  it  has 
been  possible  to  adjust  the  values  of  tolerance  and  signifi¬ 
cance  level  for  the  t  tests  so  that  the  resulting  model  included 
only  those  predictor  variables  that  are  significant  and  not 
highly  correlated.  Two  questions  remain,  however,  and  they  are: 
1)  how  Irrge  does  R  need  to  be  before  a  model  represents  a 
meaningful  relationship  between  independent  variables  and  the 
dependent  variable?  and  2)  how  well  will  a  model  predict  with 
a  new  set  of  data? 

The  first  question  can  be  resolved  by  returning  to  the 

2  2 
definition  of  R  .  Based  on  its  definition,  the  closer  R  is 
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j 


Figure  9 .  Histogram  of  Residuals  jj 

The  residual  plot  (Figure  10)  shows  little  structure,  suggest-  ! 

ing  the  current  model  is  adequate.  The  model  is  exponential  J 

and  can  be  expressed  as 

„  N  - 

Y  =  exp(3n  +  l  6-X.)  .  ij 

i  =  l 

The  set  of  predictor  variables  chosen  in  this  final  model  were 
selected  from  a  set  composed  of  the  original  nine  independent  f 

variables,  plus  square  terms,  cross  terms,  and  both  inverse 
and  natural  log  transformations  of  the  independent  variables. 

i 
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2 

with  R  =  0.880.  Table  9  lists  the  ANOVA  table  and  variable 
coefficients . 


TABLE  9 


ANALYSIS  OF  VARIANCE  TABLE 


SOURCE 

SS 

OF 

MS 

GRAND  LCAN 

8928.089 

1 

REGRESSION 

554 . 854 

6 

92.476 

RESIDUAL 

75.748 

370 

.205 

TOTAL 

9558.691 

377 

25.355 

THE  SIGNIFICANCE  OF  REGRESSION  -  1.0000 

(SIGNIFICANCE:  AREA  UNDER  CURVE  FROM  0  TO  COMPUTED  F) 
R  SQUARE  -  .880 


F 

451.708 


COEFFICIENT 

B/SICUA(B) 

CONFIDENCE 

INTERVAL 

TERM 

B 

SIGMA(B) 

T 

LOVER 

UPPER 

BO 

10  155 

1 .65E-01 

61 .371 

9.830 

10.481 

B1 

“1.540 

3.13E-02 

“49.225 

-1 .602 

-1.478 

B2 

1.583 

3.77E-02 

42.043 

1.509 

1  .658 

B3 

-.123 

4.65E-03 

"26.516 

-.132 

-.114 

B4 

.007 

5.34E-04 

12.616 

.006 

.008 

B5 

.000 

2.26E-Q5 

7.117 

.000 

.000 

B6 

-.016 

3.36E-03 

“4.627 

-.022 

-.009 

THE  THEORETICAL  VALUE  FOR  T  AT  THE  0.05  LEVEL  AND  370  DF  -  1.967 


LN  TIME  =  f(LN  FLOW , LN  WT , (C02 ) 2 , RH , TEMP2 , PRESS ) 

All  variables  are  significant  at  the  0.001  level  as  is  the 
developed  equation.  By  calculating  the  t-statistic  for  an 
outliers  test  as  defined  earlier  it  is  observed  that  for  case 
126  with  the  largest  studentized  residual  of  3.062,  the  t- 
statistic  is  2.9924.  From  the  table  of  critical  values  for 
the  outliers  test,  t  =  3.87  for  ot  =  0.05  and  d.f.  of  369.  This 
implies  that  it  is  not  95%  certain  that  case  126  is  an  outlier 
and,  therefore,  no  cases  will  be  deleted.  Examination  of  the 
histogram  of  Figure  9  supports  the  assumption  of  normality. 
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The  current  model  is  then  an  exponential  model  of  the  form 


n 

Y  =  exp{3n  +  I  i .X  ) 
i=l 

and  the  relationship  between  TIME  and  the  predictor  variables 
is  curvilinear.  The  equation  is  inherently  linear  as  it  is 
possible  to  use  the  naturla  log  transformation  to  yield  a 
linear  form 


~  n  a 

£n  Y  =  Sn  +  J  3 • X .  . 

0  iii  1  1 

Examination  of  the  case  statistics  reveals  that  case  number  292 
has  a  large  value  of  Cook's  distance  of  0.2355  with  the  next 
largest  value  equal  to  0.04295.  Since  its  studentized  residual 
is  only  moderately  large,  its  influence  is  most  likely  large 
and  it  is  deleted  for  the  purposes  of  further  analysis.  Case 
292  is  the  case  with  the  highest  recorded  flow  rate  and  might 
possibly  represent  failure  of  the  model  to  predict  over  high 
rates  of  flow. 

Next,  a  regression  of  the  natural  log  of  time  with  all  of 
the  predictor  variables,  plus  second  order  terms  and  natural 
log  terms  was  run.  In  order  to  enable  the  calculation  of  a 
natural  log  of  RH,  which  at  times  is  equal  to  zero,  0.0001 
was  added  to  all  values  of  RH .  Using  the  stepwise  algorithm, 
the  regression  yielded  an  equation  of  the  form 

LN  TIME  =  f(LN  FLOW, LN  WT , <C02 ) 2 , RH , TEMP2 , PRESS )  +  constant 
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RESIDUALS 


0 


2 


Figure  7.  Histogram  of  Residuals 


Y  = 


n 

+  y 

i=l 


3iXi 


where  the  variables  selected  are  HYDRA,  LENTH,  WT ,  C02 ,  RH, 

2 

and  FLOW.  R  has  reduced  to  0.218  as  depicted  in  Table  8 
which  is  the  ANOVA  table  with  variable  coefficients.  Both  the 


histogram  of  Figure  7  and  the  normal  probability  plot  have 


improved  to  support  the  assumption  of  normality  of  the  errors 
and  most  of  the  structure  of  the  residuals  has  disappeared 
(Figure  8 ) . 


TABLE  8 


ANALYSIS  OF  VARIANCE  TABLE 


SOURCE 

SS 

DF 

MS 

F 

GRAND  MEAN 

8928.089 

1 

REGRESSION 

137.544 

6 

22.924 

17.203 

RESIDUAL 

493.058 

370 

1  333 

TOTAL 

9558.691 

377 

25.355 

THE  SIGNIFICANCE  OF  REGRESSION  -  1. 

0000 

(SIGNIFICANCE 

:  AREA  UNDER 

CURVE  FROM  0 

TO  COAff’UTED  F) 

R  SOU ARE 

- 

.218 

COEFFICIENT 

B/SIGMA(B) 

CONFIDENCE 

INTERVAL 

TERM 

B 

SIGUA(B) 

T 

LOWER 

UPPER 

BO 

3.613 

3.29E-01 

10.985 

2.966 

4.260 

B1 

.068 

1  .93E-02 

3.528 

.030 

106 

B2 

.  161 

5.47E-02 

2.943 

.053 

.268 

B3 

.012 

1 .82E-03 

6.704 

.009 

.016 

B4 

-.286 

5.24E-02 

-5.463 

-.389 

"  .  183 

B5 

.004 

1.31 E~03 

2.836 

.001 

.006 

B6 

.000 

3.46E-05 

“6.034 

.000 

.000 

THE  THEORETICAL  VALUE  FOR  T  AT  THE  0. 

05  LEVEL  AND 

370  DF  - 

1  .967 

LN  TIME  =  f (HYDRA, LENTH, WT,C02,RH, FLOW) 
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Figure  6.  Frequency  Distributions 


A  standard  error  comparison  shows  that  this  multiplicative 

model  has  standard  error  of  0.38105  versus  0.39085  for  the 

2 

exponential  model.  R  =  0.924  and  the  residual  plot  of  Figure 
14  is  approximately  normal,  while  the  plot  of  residuals  versus 
predicted  values  is  without  structure,  Figure  15. 


-'0  -0  5  0  0.5  1.0 

RESIDUALS 


Figure  14.  Histogram  of  Residuals 


As  with  the  exponential  model,  the  plot  of  predicted  values 
versus  actual  values  generally  supports  the  contention  of 
common  variance  of  the  errors.  Figure  16. 

A  final  approach  to  modelling  was  made  in  an  attempt  to 
provide  the  capability  to  make  predictions  beyond  the  range 


NATURAL  LOG  TIME 


of  the  current  variables  by  using  msformations  of  the  pre¬ 
dictor  variables  to  redefine  the  data  space.  The  result  of 
the  dimensional  analysis  discussed  earlier  was  to  identify 
efficiency  as  a  function  of  five  dimensionless  groups.  This 
expression  can  be  rewritten  to  demonstrate  that  efficiency  is 
a  function  of  eight  variables:  n  =  f ( p , V, e , p , T , H , L , D) .  As 
defined  earlier,  the  theoretical  breakthrough  time,  t_„ ,  of 
a  canister  can  be  calculated  by  knowing  the  weight  of  absorbent, 
the  flow  rate,  CO^  concentration ,  and  density  of  CC>2 .  Now, 
canister  efficiency  is  defined  as  n  =  and  can  be  calcu- 

lated  for  each  trial  or  case  in  the  data  matrix. 

This  approach  is  to  build  a  model  from  half  of  the  original 
data  set  to  predict  efficiency,  yielding  tB  from  the  product 
of  n  and  tTH .  From  the  data  set,  it  is  possible  to  express 
efficiency  as  a  function  of  nine  predictor  variables: 
n  =  f(RH, TEMP, WT,LENTH,DIAM, C02, FLOW, PRESS, HYDRA) .  Since 
efficiency  is  not  affected  by  weight  of  the  absorbent,  WT 
may  be  removed  from  consideration.  of  the  remaining  variables, 
only  FLOW,  LENTH ,  and  DIAM  are  restrictive  in  the  sense  that 
they  limit  application  of  any  model  to  small  test  size  canisters 
However,  by  reexpressing  LENTH  and  DIAM  as  a  ratio  and  replacing 
FLOW  with  mean  linear  gas  velocity  defined  as  V  =  FLOW/A  , 
where  Acs  is  cross-sectional  area  of  the  canister,  the  predictor 
set  ranges  over  the  full  extent  of  practical  applications. 
Efficiency  can  now  be  expressed  as  a  function  of  seven  variables 
as,  =  f  { RH  ,  TEMP  ,  L/D ,  CO.-,  ,  V,  PRESS  ,  HYDRA)  .  Examination  of  the 


correlation  matrix  of  Table  13  reveals  that  several  variables 


are  negatively  correlated,  yet  none  are  very  highly  correlated 
with  each  other. 


TABLE  13 

CORRELATION  MATRIX 


RH 

TEMP 

LOD 

co2 

RH 

1.000 

-0.091 

-0.061 

0.087 

TEMP 

-0.091 

1.000 

0.035 

-0.032 

LOD 

-0.061 

0.035 

1.000 

-0 . 162 

CO  2 

0.087 

-0.032 

-0 . 162 

1.000 

VBAR 

0.018 

0 .118 

0.030 

-0 . 113 

PRESS 

0 .217 

0 . 144 

0.030 

-0 . 417 

HYDRA 

-0.038 

-0.071 

-0.012 

0.021 

EFF 

0.440 

0 . 191 

0 . 100 

-0.275 

VBAR 

PRESS 

HYDRA 

EFF 

RH 

0.018 

0.217 

-0.038 

0.440 

TEMP 

0.118 

0 . 144 

-0.071 

0.191 

LOD 

0.030 

0.030 

-0.012 

0.100 

co2 

-0 . 113 

-0.417 

0.021 

-0.275 

VBAR 

1.000 

-0.024 

-0.143 

-0 . 362 

PRESS 

-0.024 

1.000 

0.097 

0.183 

HYDRA 

-0.143 

0.097 

1.000 

0.005 

EFF 

-0 . 362 

0.183 

0.005 

1.000 

After  calculating  efficiency  as  t_/t_„r  it  was  discovered 

bin 

that  in  17  of  the  188  cases,  efficiency  was  greater  than  one. 
These  values  were  all  changed  to  one  as  an  efficiency  greater 
than  one  has  no  meaning  and  in  this  situation  is  most  likely 
the  result  of  error  in  measurement.  A  regression  analysis 
was  run  with  efficiency  as  the  dependent  var iable  and  resulted 
in  an  equation  of  the  form 

5 


57 


where  the  predictor  variables  are,  in  order,  RH,  V,  CO^ ,  TEMP, 

and  PRESS.  The  variable  coefficients  are  0.00346,  -0.64302, 

-0.11180,  0.00983,  -0.00740  with  a  constant  term  of  -0.10873. 

2 

R  =  0.569  and  all  variables  are  significant  at  the  u  =  0.01 
level.  Both  L/D  and  HYDRA  were  not  selected  as  they  were  not 
significant  at  the  0.01  level.  The  residual  plot  of  Figure  17 
shows  some  structure  suggesting  that  a  transformation  of  data 
might  provide  a  better  fit. 


Figure  17.  Plot  of  Residuals  vs.  Predicted 


The  .  togram  of  residuals  in  Figure  18  indicates  a  slight 
d-  pariure  from  normality.  Figure  19  is  a  plot  of  efficiency 


versus  predicted  efficiency  and  depicts  the  variability  of 
predictions  over  the  range  of  the  dependent  variable. 

The  next  step  was  to  examine  various  transformations  of 
the  independent  variables  in  an  attempt  to  remove  the  structure 
from  the  residuals.  Various  trials  using  power  transformations 
and  cross  terms  resulted  in  a  model  for  efficiency  of  the  form 

/V  6  A 

V  =  80  +  8iXi 

where  the  independent  variables  are  (RH  x  TEMP)  ,  (LN  V)  , 

(RH  xC02)  ,  (V  x  PRESS),  (RH  xV),  and  (TEMP  x  LOD)  .  The  variable 

coefficients  are  0.0000816,  -0.12947,  -0.00141,  -0.04283, 

-0.00297,  and  0.000305  with  a  constant  term  of  0.01782.  R2  =  0.696 

and  all  variables  are  significant  at  the  a  =  0.01  level.  The 

residual  plot  of  Figure  20  shows  some  improvement  from  the 

previous  model  and  the  histogram  of  residuals  in  Figure  21 

is  more  normal  than  before.  Figure  22  depicts  the  variability 

of  predictions  over  the  range  of  the  dependent  variable. 

Both  models  were  developed  using  half  of  the  data  set  and 

cross-validation  was  performed  as  before  using  the  second 

model,  producing  the  results  of  Table  14.  Shrinkage  is  equal 

2 

to  0.156,  however,  R  is  equal  to  0.533  indicating  a  moderately 
strong  relationship  based  on  the  scale  provided  earlier. 

E.  INTERPRETING  THE  RESULTS 

Utilizing  the  techniques  of  multiple  regression,  two 
separate  models  have  been  developed,  one  exponential  and  the 
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Figure  22.  Plot  of  Efficiency  vs.  Predicted 


TABLE  14 

STATISTICS  FOR  CROSS-VALIDATION 


Development  Sample 
Variance  of  Y  =  0.08655 


Cross-Validation  Sample 
Variance  of  Y  =  0.07928 


Variance  of  predicted 
values  =  0.05989 


Variance 

values 


of  predicted 
=  0.04376 


R2  =  0.689 


R 


cv 


0 . 533 
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other  multiplicative,  to  predict  the  canister  life,  tg ,  of 
an  axial  flow  canister  containing  High  Performance  Sodasorb. 

Both  models  have  selected  the  same  set  of  independent  or 
predictor  variables  to  estimate  tg .  The  variables  chosen  are 
percent  relative  humidity  of  the  gas  stream  (RH) ,  temperature 
in  degrees  Fahrenheit  of  the  inlet  gas  (TEMP) ,  weight  in  grams 
of  the  chemical  absorbent  (WT) ,  percent  by  volume,  surface 
level  equivalent,  of  CC^  in  the  inlet  gas  (CC^)  ,  absolute  rate 
of  flow  of  the  inlet  gas  in  cubic  centimeters  per  minute  (FLOW) , 
and  absolute  pressure  of  the  environment  expressed  in  atmos¬ 
pheres  absolute  (PRESS) .  The  variables  not  included  are 
canister  length  in  inches  (LENTH) ,  canister  diameter  in  centi¬ 
meters  (DIAM) ,  and  moisture  level  of  the  chemical  absorbent 
( HYRDA) .  Earlier  examination  of  the  correlation  matrix  revealed 
that  both  length  and  diameter  are  highly  correlated  with  both 
weight  and  flow.  Chemical  hydration  level  is  usually  fixed  by 
the  manufacturer  and  the  results  of  this  analysis  indicate  it 
has  small  significance  in  predicting  tg . 

Both  models  have  satisfied  the  assumptions  inherent  in 
regression  analysis  as  demonstrated  by  the  case  analysis. 

All  coefficients  are  highly  significant  and  approximately 
92%  of  the  variability  of  the  data  is  explained  with  each 
model.  However,  the  worth  or  value  of  these  models  lies  in 
their  ability  to  predict  canister  life,  tg. 

As  stated  earlier,  regression  analysis  provides  a  basis 
for  making  predictions  within  the  ranges  of  the  values  of  the 
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independent  or  predictor  variables.  This  restricts  the  use 
of  the  models  to  the  ranges  provided  earlier  in  Table  4. 

If  a  canister  of  known  dimensions  is  filled  with  HP  Sodasorb 


and  subjected  to  a  gas  stream  flow  rate  with  known  CO^  concen¬ 
tration  and  relative  humidity  at  a  particular  depth  and  tempera¬ 
ture,  then  the  life  of  that  canister  can  be  predicted. 

Predictions  made  outside  the  range  of  the  data  are  suspect 
and  not  reliable.  This  is  demonstrated  by  the  following  example 
calculations.  The  MK-12  surface  supported  dive  system  utilizes 
a  canister  14.75  inches  in  length  with  a  6  inch  inside  diameter. 
For  a  dive  depth  of  390  feet  (12.82  ATA) ,  a  flow  rate  of  6.0 
absolute  cuvic  feet  per  minute  (acfm)  is  required  of  which  0.6 
is  utilized  by  the  diver  leaving  5.4  acfm  to  enter  the  canister. 
A  CC>2  level  of  0.58%  at  the  surface  or  0.04524%  at  depth  is 
assumed  to  enter  the  canister.  The  weight  of  absorbent  inthe 
canister  is  12  lbs.  or  5448  grams,  while  the  temperature  is 
60°  F  and  the  relative  humidity  is  assumed  to  be  100%.  Using 
the  exponential  model,  tQ  is  predicted  as  891  minutes  or  14.85 
hours.  Theoretical  canister  breakthrough  is  calculated  to  be 
1355  minutes  or  22.6  hours.  In  actual  practice,  the  MK-12 
dive  system  has  been  utilized  to  slightly  more  than  10  hours 
as  reported  in  Reference  1. 

Since  the  models  generated  are  restricted  in  use  to  the 
range  of  the  values  of  the  independent  variables  provided  in 
the  data  set,  this  limits  their  application  to  small  test  size 
canisters  that  are  too  small  for  diving  applications.  In  order 
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to  successfully  apply  this  analytical  approach,  additional 
tests  would  have  to  be  made  with  larger  canisters,  allowing 
for  greater  absorbent  weights  and  higher  gas  stream  flow 
rates . 

The  final  approach  of  modelling  efficiency  as  a  function 

2 

of  the  data  resulted  in  two  models  with  R  values  of  0 . 569 
and  0.696  respectively.  Most  importantly,  this  approach  allows 
t  to  be  calculated  from  the  product  of  n  and  tmu  over  a  much 
larger  data  space  than  before.  Sample  calculations  were  per¬ 
formed  for  the  MK-12  dive  profile  as  previously  discussed  for 
both  models.  The  variable  values  were  as  follows:  RH  =  100, 

V  =  0.46,  C02  =  0.04524,  TEMP  =  60,  PRESS  =  12.82,  L/D  =  0.9678. 
The  first  model  predicted  an  efficiency  of  0.431  yielding  a 
canister  breakthrough  time  of  584  minutes  or  9.7  hours.  The 
second  model  predicted  an  efficiency  of  0.230  yielding  a 
canister  breakthrough  time  of  311.7  minutes  or  5.2  hours. 

Recall  that  canister  breakthrough  of  approximately  10  hours 
has  been  observed  in  actual  practice  for  this  MK-12  dive 
profile.  The  ranges  of  the  variables  used  in  the  model  develop¬ 
ment  are  listed  below,  defining  the  space  over  which  application 
of  these  models  is  restricted. 


MEAN 

VAR 

MIN 

MAX 

PH 

61 . 7 

2215 

0 

100 

V 

0.2067 

0.0446 

0  .0219 

1.465 

C°2 

1.1445 

1.3295 

0.031 

4 .35 

TEMP 

66.9947 

84 .6363 

35 

80 

PRESS 

4 .1277 

48.5291 

1 

32 

L/  D 

2.6606 

2.9050 

0 .2514 

16.36 
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Ill . 


CONCLUSIONS 


The  CO 2  absorption  process  in  a  canister  is  a  chemical 
process  requiring  both  water  and  heat  in  order  to  perform. 

Water  is  provided  in  the  chemical  absorbent  and  in  the  moisture 
of  the  gas  stream.  The  process  is  highly  influenced  by  the 
rate  of  flow  through  the  canister,  the  concentration  of  CO2 
in  the  inlet  gas,  the  moisture  content  of  the  inlet  gas,  the 
operating  temperature  and,  to  some  extent,  the  operating 
pressure.  The  effect  of  pressure  is  to  change  the  density 
and  viscosity  of  the  inlet  gas  and  more  importantly  to  affect 
the  actual  water  content  of  the  gas  stream  [Ref.  1] . 

The  models  generated  to  predict  canister  breakthrough  time 
appear  to  meet  the  assumptions  inherent  to  multiple  regression 
analysis  and  explain  approximately  92%  of  the  variability  of 
the  data.  Their  application  is  limited  however  to  axial  flow 
canisters  with  steady  flow  and,  most  importantly,  to  small 
canister  dimensions.  In  order  to  utilize  this  approach  of 
modelling  canister  breakthrough  as  a  function  of  the  available 
data  and  be  able  to  apply  the  model  to  a  wide  range  of  canister 
sizes,  more  experimentation  would  be  required. 

Utilizing  the  definition  of  theoretical  canister  life  and 
canister  efficiency  provided  in  Reference  1,  canister  efficiency 
«as  calculated  for  each  of  the  188  trials  representing  half 
of  the  data.  Canister  efficiencv  was  then  modelled  as  a 


function  of  the  eight  independent  variables.  By  reexpressing 
gas  volumetric  flow  as  gas  mean  velocity  and  reexpressing  length 
and  diameter  as  a  ratio,  the  ranges  of  the  independent  varia¬ 
bles  in  the  development  sample  include  parameter  values  for 
most  dive  profiles.  The  models  for  efficiency  in  Chapter  II 
explain  approximately  60%  and  70%  of  the  variability  of  the 
data,  respectively.  Simple  cross-validation  has  shown  that  the 
models,  while  showing  the  expected  shrinkage  do  exhibit  good 
stability  across  different  data  sets. 

These  models  should  serve  as  an  aid  to  the  design  process 
outlined  in  NCSC  TECHMAN  4110.  As  models,  they  represent  an 
attempt  to  quantify  the  relationship  of  canister  performance 
with  variuous  environmental  and  geometric  parameters. 
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appendix  b 


DATA  STATISTICS 


*ean;  56 . 76392573 

variance:  2301.202099 

sri.,  dev.:  47.97034635 

coefp,  of  variation:  0.3450938319 

lower  CIUAPTIlE:  0 

UPPER  CIUAPTILE  ;  1Q0 

median:  100 

TP  x  M  E  A  N  :  75 

midmean:  56.76392573 
range:  ioo 
Mir- range:  50 

ME  AN  ABSOLUTE  E  V  I  A  T  I  O  N  •  43.23607427 

INTERQUARTILE  '  A N G E :  100 

coeff,  of  skewness;  ”0.2722239663 
COEFF,  OF  KURTOSIS:  “1.353103560 


STATISTICS  c  2 

i>6. 47214854 
variance;  91.77648287 
std,  dev.;  9,530004325 
coeff ,  OF  variation;  0.1441205753 

LOWER  CIUAPTILE;  70 
UR  PER  GUAR TILEJ  70 
MEDIAN*  70 
T  R  I  M  E  A  M :  70 

midmean;  70 
RANGE;  4  5 

Ml  I' RANGE  :  57.5 

MEAN  ABSOLUTE  DEVIATION*  3.899204244 
INTERQUARTILE  RANGE;  Q 
coeff.  of  skewness;  ”2.264855315 
coeff  ,  OF  kurtosis;  4.098664453 


STATISTICS  C3 

mean;  19 . 99413793 

variance;  4049.837996 

std.  dev,;  63.63833747 

coeff,  of  variation;  3.132349778 

lowef  quartxle;  3,99 

ijpfer  ciuartile;  4.3 

me  i<  I  an  ;  4 

tb  I  mean;  4.1975 

midmean;  4,092277228 

range ;  381.51 

midrange;  192.805 

MEAN  ABSOLUTE  DEVIATION;  16.10692308 
interquartile  range;  0.81 
coeff,  or  skewness;  4.605224212 
coeff,  OF  kurtosis:  20.84155735 


STATISTICS  C4 

mean;  261 .4058355 

variance;  102332.1454 

STD.  DEV.;  319.8939597 

coeff,  of  variation;  1.223744524 

lower  quartile;  53,5 

upper  quartile;  333.3 

median;  150.8 

T  R I M  E  A  M ;  172.1 

midmean;  160.2650794 

range;  1966.6 

midrange;  986.4 

MEAN  ABSOLUTE  DEVIATION;  199,1527351 
interquartile  range;  279.3 
coeff,  of  skewness  ;  2.267689597 

coeff,  of  kurtosis;  5.635673376 


mean;  2.679376653 

variance;  1.233382645 

std.  dev.;  1.U0577613 

coeff,  of  variation;  0.4144910401 

lower  quartile;  2.514 

upper  quartile;  2.731 

median*  2.639 

trimean:  2.63075 

midmean;  2.627466667 

range;  14.953 

midrange;  8.5235 

MEAN  ABSOLUTE  DEVIATION*  0.2911564987 
interquartile  range;  0.217 
coeff.  of  skewness;  9.754890674 
coeff,  of  kurtosis;  110.1184256 


mean;  1.489450929 

variance;  2.809112562 

STD,  DEV,;  1,67604074 

coeff,  of  variation;  1,125274226 

lower  quartile;  0.978 

upper  quartile;  0.978 

median;  0.978 

trimean;  0.979 

midmean;  0.978 

range;  8.471 

midrange;  5,2105 

MEAN  ABSOLUTE  DEVIATION*,  0.5114827586 
INTERQUARTILE  RANGE;  0 

coeff,  of  skewness;  3.78741941 
coeff,  of  kurtosis;  13.6946633 


weftMj  1.149448276 

variance:  1.319485716 

STD.  DEV.:  1.148683694 

coetf,  of  variation;  0.9993391771 

LOWER  QuAPTILE •  0.49 

UPPER  OUARTIUEJ  1.08 

MEDIAN ;  1.05 

trimean;  0.9175 
midmeah ;  1.000019512 
RANGE  ;  4.319 
midrange;  2.1905 

MEAN  ABSOLUTE  DEVIATION;  0.6293050390 
interquar t ile  range;  0.59 
COEFF,  OF  SKEWNESS;  1.916488355 
coeff,  of  kurtosis*  2.948357655 


«ean;  1039.387416 

variance;  10965495.67 

std.  dev.;  3311.418931 

coeff,  of  variation;  3.184401435 

LOWER  QUARTILE;  132.905 

UPPER  QUARTILE;  437.658 

median;  269.806 

trimean;  277.54375 

midmean;  278.9265132 

range  ;  31161 .903 

midrange;  15604.7885 

mean  absolute  deviation;  892. 8239072 
interquartile  range;  304.753 
coeff,  OF  skewness;  5.535591624 
COEFF,  OF  kurtosis;  34.31907269 


STAT 1ST  ICS  C  9 

mean;  4.201591512 

variance;  54.6241464 

STD.  DEV,;  7.390815002 

coeff,  of  variation;  1.759051298 

LOWER  GUAR tile;  1 

U  F  F  E  R  GUAR TILE;  2 

median;  i 

TR I  MEAN ;  1.25 

midmeah;  1.074204947 

range;  31 

midrange;  16.5 

mean  absolute  deviation;  3,201591512 

Iff  TER  QUARTILE  RANGE  J  1 

coeff,  of  skewness;  2.768613901 
coeff,  of  kurtosis;  7.042060544 


STATISTICS  CIO 
wean;  13.38594164 
variance;  9.518977369 
std.  dev.j  3.085284001 
coeff,  of  variation;  0.2304868357 
LOWER  QUARTILE;  14.1 
UPPER  QUARTILE;  14.1 
MEDIAN*  14.1 

trimean;  14,1 
midmean;  14.1 
Range;  19.5 

MIDRANGE*  10.35 

MEAN  absolute  deviation;  1.000530504 
INTERQUARTILE  RANGE*  Q 
coeff,  OF  skewness;  '2.609146814 
coeff,  of  kurtosis;  7,772805196 
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